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Abstract. We study the voter model and related random-copying processes on 
arbitrarily complex network structures. Through a representation of the dynamics 
as a particle reaction process, we show that a quantity measuring the degree of order 
in a finite system is, under certain conditions, exactly governed by a universal diffusion 
equation. Whenever this reduction occurs, the details of the network structure and 
random-copying process affect only a single parameter in the diffusion equation. The 
validity of the reduction can be established with considerably less information than 
one might expect: it suffices to know just two characteristic timescales within the 
dynamics of a single pair of reacting particles. We develop methods to identify these 
timescales, and apply them to deterministic and random network structures. We focus 
in particular on how the ordering time is affected by degree correlations, since such 
effects are hard to access by existing theoretical approaches. 



1. Introduction 

The voter model [IrS] is arguably the simplest nonequilibrium ordering process. Its 
dynamics are perhaps most easily described within its original formulation as a model 
for population dynamics [I]. Each site of a lattice (or network) is occupied by an 
individual belonging to one of a number of species. In each timestep, a site is chosen 
at random, and the individual at that site is replaced by an offspring (copy) of an 
individual selected at random from a neighbouring site. In a finite system, one of the 
possible absorbing states — in which only a single species remains — is eventually reached. 
From a physical perspective, the interest in this process is that order is arrived at purely 
through fluctuations, rather than driven by surface tension as in the Ising model (see 
e.g., 016]; also [7] for a recent review). 

Many variants of the basic voter model have been investigated, some quite recently: 
examples include those where multiple individuals occupy a site [8] , where the site chosen 
in each timestep contains the individual that is copied rather than replaced [HEED], or 
where different sites have different probabilities of being chosen for update [11] . Specific 
instances drawn from this wider class of random- copying models have applications in 
population genetics, ecology and cultural evolution, as reviewed in [12J. Despite their 
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simplicity, these models reproduce observed patterns of species diversity in ecosystems 
[T3] and of the cultural dynamics of features like baby names [14] . They also play an 
important role as null models of evolution in the absence of selection [T5] . 

A stochastic equation of motion for the frequency £ of a chosen species is 
straightforwardly obtained in the mean-field limit (i.e., when the system lacks spatial 
structure). More precisely, when each site is visited for update at a uniform rate m, and 
the individual landing on the target site is sampled uniformly from the entire population, 
the probability distribution of £ is — for sufficiently large systems — governed by 
f) m rP' 

Derivations of this equation can be found in many places, e.g., [T2l [THJ [TT] . These 
stochastic dynamics have three key properties. First, there is no deterministic (drift) 
term in the above equation, confirming that order is attained through fluctuations alone. 
Second, all timescales of the ordering process are proportional to the parameter N e : in 
the mean field, this parameter is simply equal to the number of sites, N. Finally, the 
full, time-dependent solution for P(£, t) from arbitrary initial condition is known [16J. 
Then, for example, the probability that complete order has been attained by time t is 
given by the combination P(0,t) + P(l,t). 

Many recent works have switched focus to models defined on nontrivial network 
structures, notably heterogeneous networks in which the degree can vary widely between 
nodes [9|fi miT8ti27] . Such networks are particularly relevant to cultural applications, 
where the interaction frequency between agents need not be purely a function of spatial 
proximity. Remarkably, it has been found that even in the presence of significant 
heterogeneity, the ordering dynamics is well- described by the mean- field equation (OQ). 
For example, the mean time to reach complete order as predicted by ([1]) has been found 
to apply in models with complex spatial structure [T0 II23H2T] . The only differences are: 
(i) that £ is a weighted average of a species frequency over the network; and (ii) the 
effective size, N e , need not equal the actual network size N (or even scale linearly with 
it). Thus, a good understanding of ordering behaviour within random-copying models 
on networks can be gained if one can identify when a reduction to ([1]) is possible, and if 
so, what the value of N e is. For example, the entire distribution of ordering times can 
be read off from the exact solution of (JTJ . 

In this work we will demonstrate that the mean-field diffusion equation (|Tj) is in fact 
exact for general random-copying processes on arbitrary networks at all but the very 
earliest times when there is a separation of emergent timescales within a two-particle 
coalescence reaction dynamics that is dual to the random- copying process [2Ell2Sj- In 
several earlier works [9|fi mi23ti27] . the equation (TjQ) was typically obtained by making 
specific assumptions and approximations, and the fundamental physical reason why the 
simple diffusion ([T]) should apply was not clearly identified. 

The central object in our analysis is the quasi-stationary distribution of a pair of 
particles that have survived for a long time without reacting. This state decays at 
some characteristic rate, and if its lifetime greatly exceeds the relaxation time to the 
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quasi-stationary state from a given initial condition, the model exhibits the separation 
of timescales that is required for the validity of ([T]). The effective size N e is then 
proportional to the single, dominant timescale in the reaction dynamics, that is, the 
lifetime of the quasi- stationary state. 

This paper is divided into two parts. In the first part (Sections EH5]) we define 
the general class of random-copying models that includes well-studied instances, like 
the voter model, as special cases, and derive our main results. These are: Eq. ([TJ, the 
condition for its validity and the expression for the fundamental parameter N e . As we 
will need to switch repeatedly between the forward-time formulation of the dynamics in 
terms of a diffusion equation and an equivalent backward-time particle reaction process 
whose details are not widely discussed in the physics literature, we must unfortunately 
contend with lengthy, but necessary, preliminaries in Sections [2HU Once this is done, 
the derivations of our main results follow in Section |5j We emphasise that the validity 
of Eq. ([[]) can be determined by considering only a single pair of coalescing random 
walkers on the network. In principle, ascertaining whether the system is completely 
ordered requires knowledge of the state of all N sites simultaneously: hence being 
able to determine whether the considerably simpler single-coordinate diffusion applies 
without having to solve the full dynamics is of great practical benefit. 

In the second part of this paper, we go beyond purely formal statements about 
random- copying dynamics and focus on strategies to estimate the emergent timescales 
of the two-particle reaction process. These methods are developed in Section E] and 
used to demonstrate the validity of the diffusion (pQ) for specific combinations of network 
structures and copying dynamics in Sectional Although these methods are restricted to 
random- copying processes that map onto time-reversal symmetric random walks, it turns 
out that the most widely-studied variations on voter-model dynamics [9| [T0l[20l[26| l27] 
fall into this category. On the other hand, we do not (in contrast to some existing 
treatments (9j,[T0l[23, 25-27J) invoke a coarse-graining over multiple sites of the network 
to obtain a diffusion equation. Thus, we can use our analytical methods to investigate 
the effects of degree correlations on the effective size N e that appears in Equation ([TJ, 
which until now has been accessible only by computer simulation. 

We remark that the idea that random-copying dynamics in the presence of spatial 
structure may be adequately represented by a mean-field model with some effective size 
N e has a very long history in population genetics, dating back to the work of Wright 
from the 1930s [30]. Furthermore, various separation of timescales arguments have 
been employed as a means to identify an effective size [STHSU] . In this work, our main 
contribution to this line of enquiry is to highlight the two particular emergent timescales 
that are relevant in determining the applicability (or otherwise) of a single effective size: 
their importance does not appear to have been fully recognised before. We furthermore 
show that this separation exists even in cases where it has been argued that there will 
be more than one relevant timescale in the dynamics, and that a description in terms of 
a single effective size will not apply [35]. We discuss these points in much more detail 
in the conclusion, Section |H1 along with a number of remaining open questions. 
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Figure 1. Random-copying dynamics on a network. Here, each site hosts M = 3 
individuals and there are four species in total. The individual with dotted outline on 
site i has been selected for replacement by copying an individual on site j. This event 
takes place as a Poisson process with rate /Lty . 

2. Definition of a class of spatially-structured random-copying models 

We consider a network of N nodes (or sites) labelled by the integers i — 1, 2, 3, . . . , N. 
Site i hosts a population of M individuals, each of which is a member of one of a 
number of species. The dynamics is specified in terms of a set of copying rates Each 
corresponds to an independent Poisson process! dut Felts Hij one of the M individuals 
on site j is copied, and the offspring individual replaces a randomly- chosen individual 
on site i. See Figure [U Note that the case i — j is legitimate, and indeed the rates 
fin will typically be nonzero in any particular instance of this model. We could also 
consider models where M varies from site to site within the framework presented below: 
however, this serves only to complicate certain definitions and expressions so we shall 
restrict the discussion to a spatially-uniform M. 

The rates /ijj determine the model's network structure. Elements of the adjacency 
matrix Aij, which indicate if sites i and j are neighbours, are zero if ^ = 0, and unity 
otherwise (except the diagonal elements An which are always taken to be zero). The 
degree Zi of node i can then be defined as 

z i = X! Aij ■ (2) 

j 

Although we no not require the adjacency matrix to be symmetric in the first part of 
this paper, we will for simplicity assume this is the case. (For example, we do not 
then have to distinguish between in-degree and out-degree). Note, however, that the 
symmetry = Aji does not imply that fiij = fiji. 

In a finite system, there is some probability that a species eventually goes extinct— 
this occurs if all its representatives have been replaced by individuals of other species. 
Over time, the number of species decreases until only a single one remains. This 
stationary absorbing state of complete order is referred to variously as consensus or 
fixation (in the sociodynamics and genetics literature respectively). 
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2.1. Voter, invasion and link dynamics 

The original voter model dynamics specifies that, in each timestep, a site is first chosen 
at random, and then a copy taken from one of its neighbours. This constrains the 
copying rates //y to have the form 

^ = ^ (3) 

where /i is the rate at which a site is chosen for update. This is, however, only one 
of a vast range of possible ways to construct a set of copying rates. Also discussed in 
the literature is the invasion process, whereby the parent site j is chosen uniformly at 
random in each timestep, and the offspring 'invades' a randomly chosen neighbour (see, 
e.g., [9j[T0]). This corresponds to the choice 

4 = ^ • (4) 

z j 

Another possibility is link dynamics [9lfl0] . in which an edge between two nodes is chosen 
at random, and the copy made in one of the two possible directions along that edge. 
One way to specify this process is through the rates 

4 = ^ (5) 

where z is the mean degree within the network. In networks where all nodes have 
the same dee; for all i,j, and these three choices are all equivalent. 

On heterogeneous networks, these different rules have been found to lead to different 
scalings of the effective size N e with the actual network size N. For concreteness, we 
will mostly consider the above copying schemes. However, we stress that many of our 
results are quite general, and are not restricted to these specific choices. 



2.2. Discrete- and continuous-frequency models 

In the original formulation of the voter model, there is one individual per site: M = 1. 
Then, the frequency of a given species on a site can take only one of two values: 
or 1. To obtain a diffusion equation for the copying dynamics without the need to 
average over multiple sites (for example, all sites of the same degree, as has been done 
elsewhere 0HIIII221I22H2Z]), we w ^ the limit M — > oo, so that the frequency Xj 
of a chosen species on a site becomes a continuous variable in the range [0, 1]. We will 
use the terminology discrete- frequency and continuous-frequency models to refer to the 
extreme cases M = 1 and M — > oo respectively. 

It is not a priori obvious that results obtained for continuous-frequency models will 
apply to their discrete-frequency counterparts. However, it turns out that for any fixed, 
finite M, discrete-frequency dynamics are recovered in a limit where the copying rates 
between sites, ^ where % j, vanish. This fact can be deduced from a convergence 
theorem for Markov chains due to Mohle [37] . The basic physical idea behind this 
result is that in the time it takes for an individual to arrive from a neighbouring site, 
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sufficiently many replacements have been made by sampling from the local population 
of M individuals that each site is in one of its locally absorbing states — that is, each 
site hosts only a single species. As a consequence, the frequency of each species on any 
chosen site is always close to or 1 (although globally it may be somewhere in between) 
and thus the discrete-frequency dynamics is recovered. 

The way in which the M = 1 discrete-frequency dynamics is obtained 



mathematically is summarised in |Appendix A[ We illustrate here with the concrete 



examples of the previous subsection. Discrete-frequency versions of voter, invasion or 
link dynamics can be obtained by taking the parameter fi that appears in ([SD^© to 
zero, whilst holding all fin at fixed, finite (but otherwise arbitrary) values. To recover 
the M — 1 version of the voter, invasion or link dynamics, with the unit of time defined 
such that each site is visited on average once per timestep, one must multiply all times 



by fi/M (see Appendix A). 



A key point is that for any finite M, no matter how large, the discrete-frequency 
dynamics are recovered for sufficiently small \i. That is, any difference between a set 
of frequencies {x{\ in the M > 1 model and their values in corresponding realisations 
of the M = 1 model can be made arbitrarily small by reducing //. In the next section, 
and the remainder of this work, we will be working in the M — » oo limit, such that 
all frequencies can be regarded as continuous variables. We will assume that results 
for discrete frequency models can then be obtained by subsequently taking fi — > 0. In 
other words, we assume that the limits M — > oo and \i — y commute. We do not 
know a priori that this assumption is valid. In practice, we find that we recover known 
results for the voter model, or good agreement with simulation, when we perform this 
procedure (see Section [7j). 

3. Forward-time formulation as a Fokker-Planck equation 

We now derive the Fokker-Planck equation for probability distribution over the N- 
dimensional space of local frequencies Xj of a chosen species. Although the method used 
to derive this equation, based on jump moments, is standard [38] , we will recapitulate 
it here as it central to the derivation Eq. ([I]) that follows in Section [5] below. The key 
point is that the Fokker-Planck equation has the expansion 

^ M =-E^! 1) (M,r)P(M,r) + (6) 

i 1 

^E^-4 2) ({^^({^},-) + --- 

where r is the time variable and the jump moments are defined as 

>) (U \ r\ - Krr, ([ X h(T + 5T)- Xil (T)]---[Xi n (T + 5T)-Xi n (T)]) 

In calculating the jump moments, the initial coordinates Xj(r) are assumed known and 
fixed, and the angle brackets denote an average over all possible realisations of the 
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dynamics in the time interval [r, r + 8t\. These averages can be computed from the 
underlying stochastic dynamical rules. 

Within the present class of random-copying models, a local frequency Xi may 
increase or decrease by an amount ■h in any single update. The continuous-time 
transition rates for these processes are 

W (n ->■ Xi + —) = (1 - n) E IMjXj (8) 

3 

W (xi ^ ^ ~ Tj) = x iY, M 1 - x i) ■ ( 9 ) 

3 

In the time interval [t, 8t], only events involving a single change to one of the Xj 
contributes to order 5t in the average in (J7J), for any value of n. Thus, for the first 
jump moment we have, 

a i 1] = E ^A x 3 - X i) . ( 10 ) 

3+i 

and for the second moments 



^ = ThH^3 \ X i( l ~ X l) + X A l ~ X i)\ ( U ) 



1 



M 2 



3 



(2) 

while a\j = for % ^ j, since two different x^ never change at the same time. 

If all Hij remain fixed as M — > oo, the second jump moments vanish relative to the 
first. The resulting Fokker-Planck equation then has only a drift (deterministic) term, 
and no diffusive component. Both terms are contribute in this limit if one takes /i^ to 
vanish as 1/M for all j ^ i, as is standard in population genetics [TiZllTo] . We thus define 
rescaled rates and q through 

a ■ ^ 

2 l=J 

By introducing a rescaled time t = r/M 2 we find as M — > oo the Fokker-Planck equation 
d ( d d \ 

—P({Xi},t) = J2 ( m V]te ~ mji dx~) ( X i- X i) P (i X i}^) + 

1 d 2 

^E '^ 1-1 ^ 1 '}'^ ( 13 ) 

The third and higher jump moments are all of order 1/M 3 and higher, and thus do 
not contribute in the M — > oo limit. Although this Fokker-Planck equation is strictly 
valid in this limit, it is understood that it serves as an excellent approximation for 
large but finite M [16J. Our experience suggests that M ~ 100 yields results consistent 
with infinite M (see Sect ion [7T21 for a comparison between finite M simulations and the 
infinite-M theory). 

In principle, to study the ordering dynamics of random-copying processes one would 
need to solve the full time-dependence of this N- dimensional diffusion. It is therefore 
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Figure 2. Backward-time formulation of the random-copying dynamics as a random 
walk process. Shown is the configuration of the system of Figure Q] after the copying 
event has occurred. Looking backward in time, the tagged individual on site i, shown 
with a solid fill, has a probability /ijj per unit time of having been copied from site j. 
The tagged individual thus hops to site j at rate fiij . 

apparent that the reduction to the single coordinate diffusion ([T]), when it is possible, 
represents a dramatic reduction in the complexity of the problem. 

4. Backward-time formulation as a particle reaction process 

As mentioned in the introduction, the copying process defined in Section [2] has an 
equivalent formulation as a particle reaction dynamics (see e.g. [2"8"||2"§] for mathematical 
formulations and [12] for a more detailed account from a statistical physics perspective 
than the brief outline we give here). This process is identified by looking backward in 
time at the history of an individual in the current configuration. At some point in the 
past, this individual was created by the copying process. The rate at which an individual 
currently on site i was created by copying an individual on site j is fiij/M, where the 
factor M appears because fiij is the total rate at which one of the M individuals on site 
i is replaced by an offspring arriving from site j. This individual we call the ancestor of 
that in the current configuration. As we look further back in time, the ancestor performs 
a random walk, hopping from site i to site j at rate fiij. See Figure [2j We will also refer 
to these ancestors as particles. 

One can also consider the ancestry of multiple individuals. These independently 
perform random walks, as just described. However there is also the possibility that at 
some point in the history, one ancestor is the direct offspring of one of the others. The 
rate at which one of a pair of ancestors occupying sites i and j was copied from the 
other is (fi^ + \Xji)jM 2 . This time, the factor M 2 appears because both parent and 
offspring have a probability of 1/M of being involved in the copying process. After this 
copying event, the two lines of ancestry merge: one individual is the common ancestor 
of the original pair. This interaction can be viewed as a particle coalescence reaction. 

To convert these rates into the system of rescaled time units defined in the previous 
section, we multiply them by M 2 . We then find that an ancestor hops from site i to j 
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at rate m^-, and a pair of particles on site % react at rate q. The rate for particles on 
different sites % 7^ j to react vanishes under this rescaling. 

Within this backward-time picture, it is important to realise that if one starts with 
an arbitrary pair of individuals in a finite system and looks far enough back in time the 
ancestor of one individual will at some point be the direct descendant of the other. That 
is, any two particles eventually react, leading to an overall reduction in the number of 
particles over time (i.e., fewer ancestors who contribute descendants to the present-day 
population). The steady state of the particle reaction process thus comprises a single 
common ancestor of any set of individuals chosen from the present-time population. 
If there is a single ancestor common to the whole population, then it follows that all 
individuals at the present time are of the same species — i.e., that complete order has 
been reached at the present time. Thus the stationary absorbing state of full order 
in the forward-time dynamics corresponds directly to the stationary state of a single 
particle in the backward-time picture. 

In the remainder of this work we need only to consider the backward-time dynamics 
starting from a single pair of particles. Two key properties of this reaction-diffusion 
process are the probability distributions Quit) that a single particle on site i at time 
t = occupies site k at time t and Q k e(t) that a pair of particles occupy sites k and t. 
Note that in the reaction dynamics, we take increasing time to mean further back into 
the past, with t = indicating the present time. When talking about a particle pair, 
we will consider the particles to be distinguishable, so that the configuration k£ is not 
the same as lk\ik^ I. These conventions make writing down the master equations for 
these probability distributions slightly more straightforward. 

The master equation governing the single-particle distribution function is 

= E [Qi(t)m ik - Q k (t)m ki ] = - £ Qi(t)M^ . (14) 
Here we have introduced the matrix 

M^ = \~ mk ^* (15) 
I Ej m ij i = k 

where the sign of the entries has been chosen so that the eigenvalues of all 
have nonnegative real parts. Assuming that the network is such that the stationary 
distribution Q k = lim^oo Q k (t) is unique, we have at late times 

Qk(t) ~ Qk + au k e-^ h (16) 

where is the eigenvalue of with the smallest, nonzero real part. 

Meanwhile, the joint probability distribution for a pair of unreacted particles is 
governed by the master equation 

^Qki(t) = Yl Qit( l ) m ik + E Qkj(t)rriji - Qkk(t)c k 5 ke . (17) 

i j 

The first two terms in this equation give the contributions to this probability from 
particles hopping into, or out of, sites k and I. The final term arises from the decrease 
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in the probability that the two particles have not yet reacted due to the coalescence 
reaction. We can write this equation more compactly as 

j t Qkt{t) = -Y,Qij(t)M%u (18) 

ij 

(2) 

where the matrix M\^ M is 

(2) 

M ij'u = -mikSjt - mjidik + Ci5 ijk i . (19) 

Here we use 5ijki as a shorthand for 5ij5jkSke, i.e., the object that is zero unless all four 
indices are the same, in which is it equals unity. 

On a finite network where the stationary distribution for a single particle is unique, 
the two particles will eventually react. Therefore Qke(t) vanishes as t — > oo. The matrix 
M^ 2 ' has eigenvalues whose real parts are all positive. The smallest eigenvalue A^ 
is strictly positive, and is equal to the asymptotic rate at which a pair of unreacted 
particles coalesce. The associated eigenvector we will denote Qke and normalise so that 
HkiQki = 1- This is the quasi-stationary distribution of the two-particle system. That 
is, if we condition on the two particles never having reacted, the probability they are 
found on sites k and I approach the stationary value Qm [3S]- The rate of relaxation 
to the quasi-stationary state is given by the real part of the second-smallest eigenvalue 
A^ 2 ). That is, at late times 

Quit) ~ Q M e- x * h + bv M e- x ' 2h . (20) 

As stated in the introduction, and to be demonstrated below, it is a separation of 
timescales between the relaxation to and decay of the quasi-stationary state, controlled 
by A^ 2 -* and A^ respectively, that allows for the reduction of the full Fokker-Planck 
equation ( TT3|) to the single-coordinate diffusion of Eq. (fTJ. 



5. Exact reduction to a single-coordinate diffusion equation 

Now that the forward- and backward-time formulations of the model defined in Section [2] 
have been set up, we may derive the single-coordinate diffusion ([TJ and establish the 
condition for its validity. This amounts to identifying the quantity £ whose ensemble- 
average is conserved by the dynamics, and calculating its second jump moment. The 
procedure is not hard, but it does involve several manipulations that require explanation. 
The key quantity £ has been identified a number of times before (see e.g., 

0HUIEO1E1]). It is 

= (21) 

i 

where we recall that Qi is the stationary distribution of the single-particle random walk, 
and Xi is the frequency of the species of interest on site i. To show that the diffusion 
equation ([1]) has no deterministic term, we calculate the first jump moment 

a ^ *) = 5 Qi fc* st ( 22 ) 
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By multiplying (1T3|) by Xi and integrating over all the x coordinates (see [2HH0]), we 
find 

±( Xl (t)) = i;n^)>-%(^))] = -T, M $ ) Mt)) ( 24 ) 

where is as defined in Eq. (lT5i) . Hence, 

= -e (e^4 } ) (**(*)> = o (25) 

j \ i ) 

where the second equality follows because Qi is the stationary distribution of the single- 
particle random walk dynamics. In other words, it is the left eigenvector of the matrix 
with eigenvalue zero. Thus the first jump moment of £ vanishes, as required. 
We now turn to the second jump moment of £. For brevity, we define Xi = Xi(t) 
and x\ = Xi{t + St). Then, 



a (2) (£,*)=£ Q<Qj ]im {K Xl ]f ^ (26) 

/ nfJ nfJ \ ry* , rf* . ry* . ( / rp \ rp . \ 

y 

Notice that we take all the Xj variables to have known values at time t. There is a 
slight complication here: only the value of £ is known at time t, not the individual £j. 
However, we will shortly find that at late times a^(£) depends only on £, and hence 
the resulting expression will apply no matter what the actual individual values of Xi are 
in any given realisation of the dynamics. 

As we have shown above, (£(£)) has a zero time derivative, so only the first term in 
the previous expression contributes to the second jump moment. To make progress, it 
is useful to introduce the quantity 

P^t) = { Xi {l - Xj )) + ((1 - Xi) Xj ) . (29) 

This is the probability that two individuals randomly selected from sites i and j are of 
different species. On a finite network, this quantity asymptotically decays to zero, and 
in fact, its time evolution is governed by the equation 

jP %3 {t) = -M% M P kt (i) (30) 

where is the transition matrix for the two-particle reaction process defined by 

Eq. (fT9"|) . One way to see this is to calculate the time derivative of (xiXj) by multiplying 
(113"]) by XiXj and integrating, as described previously. A more direct way is to consider 
events that cause the probability that two individuals are of different species to change 
in a time interval [t,t + St] (see, e.g., [41]). The rate at which Pij(t) changes due to a 
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parent on site k leaving an offspring on site i is mik[Pkj{t) — P%jif)\- Meanwhile, if both 
individuals are on the same site i, there is a probability q per unit time that one of 
them replaces the other (see Section H]), an event that causes the probability that the 
two individuals are distinct to decrease at rate CiPu(t). Summing over all events that 
change Pij we find 

which is equivalent to ( l30]h 

Since the time evolution of Pijit) and the particle pair distribution function Qke(t) 
discussed in Section H] are both determined by the same matrix M (2) , Pij{t) has a late- 
time expansion of the same general form as that given by Eq. (1201) for Quit)- That 
is, 

P tJ (t) ~ P^- X « h + cw i3 e- x " 2)t (32) 
Now let us suppose there is some limit, for example N — > oo, in which 



Ai 2) 



ReAl 2) 



->■ . (33) 



Then, under a further rescaling of time t' = X^H, we have in this limit at any t' > 



d 



P l3 (t') = P^-* =► = -^W) • ( 34 ) 

In other words, at times tl > 0, the system is assumed to be in the quasi-stationary 
state. To avoid too many different time units, we shall remain in the unsealed time 
t and assume that the quasi- st at ionarity condition fl3"3"|) is satisfied, and thus that 
±Pij{t) = -\^P t j(t) when t > 0. 

To complete the derivation of the second jump moment a^(£), we differentiate 
both sides of fl29|) with respect to time t, rearrange and substitute into fl28|) . We find 



« (2) (e,t) = fm) -It,q<q£ p m ( 35 ) 

ij 

A(2) 

= yEW^«) (36) 
A(2) 

= E QiQifc + ^ - 2 ^;) = A f - • (37) 

ij 

To arrive at the second line we used the fact that the average of £ has zero time derivative, 
that (T3"3"|) holds and t > 0. The derivation is completed using the definition of Pijit), 
Eq. fl29l) . and of £, Eq. (j21|) . As advertised, this second jump moment is a function only 
of the collective coordinate £. Using the expansion ([H]) we find that the motion of the 
collective coordinate £ is governed by the Fokker-Planck equation 

8 A (2) d 2 

^*) = i-^(l-0^). (38) 
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at any time t > in a limit where the condition f )33|) is satisfied. This is Eq. ([T]) with 
an effective size 

N e = ^ (39) 

Aq 

where we recall m is the baseline rate of the copying process. This concludes the 
derivation of the main result of this work. In the remaining sections we focus on 
strategies for determining when the key condition (I3"3"j) is satisfied, and for estimating 
the single parameter (or equivalently, N e ) that depends on the network structure 
and choice of copying dynamics when the diffusion (TjQ) or fl38l) is valid. 

We conclude this section with a couple of remarks on the derivation of 



Behaviour at time t = At very early times, and technically only at time t = in the 
limit where (1531 holds, there are additional contributions to the second jump moment. 
Whether looking forwards or backwards in time, their net effect is to project the initial 
condition onto the quasi-stationary state. To be more precise, if the leading eigenvectors 
of the matrix are normalised so that 

E% = 1, (40) 

ij 

the probability that two particles initially occupying sites i and j have not reacted by 
time t and occupy sites k and i is 

D(ij\k£;t) = PijQue-^* (41) 

if all but the quasi-stationary modes are neglected at times t > 0. By taking the limit 
t — > from above, and summing over all k and £, we find that is the probability 
that this particle pair does not react before the quasi-stationary state is reached. 

This immediate collapse onto the quasi-stationary state has implications on the 
initial condition for the diffusion equation ([I]), given an initial set of frequencies Xi(0). 
It is possible to calculate the variance in £ at the onset of quasi-stationarity by appealing 
to (129]) . In the simple, solvable examples discussed in Section [7] below where we find ([!]) 
to be valid (Wright's island model and the bipartite network), we find that for any given 
initial combination of Xj(0) a delta-function initial condition P(£, 0) = 5(£ — J2i Qi x i[®\) 
is appropriate. Whether this is generally the case is a question we leave open for future 
work. 

Diffusion of other collective coordinates It turns out that any collective coordinate of 
the form 

X {t) =$>iZi(*) (42) 

i 

where the Ui are an arbitrary set of positive weights that sum to unity, satisfies the 
diffusion equation (TjQ) as long as, in addition to fl33l) the condition 

A (2) 

— 3 ► (43) 



ReA^ 
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is also satisfied. 

This can be seen by repeating the above derivation of the jump moments with this 
more general set of weights. Ultimately, one finds 

a {1) (x,t) = f t ( X (t)} (44) 
a^\ X ,t) = (1 - 2x)^<x(*)> - \Z^jPiAt) ■ (45) 

ij 

The leading decay modes of the first jump moment is governed by the eigenvalue 
A^, while the modes corresponding to A^ and A^ contribute to the second. Thus 
we require both fl33l) and (J43]) in order to obtain a vanishing first jump moment, and a 
time-dependent second jump moment of the form 

a (2) (x,*) = Afx(l-x) (46) 

at times t > 0. 

Although all collective coordinates of the form ( I42p have the same diffusion 
equation — and crucially, the same effective size N e — when these conditions hold, we 
prefer the definition fl2T|) because its first jump moment vanishes at all times, not 
just in the quasi- stationary state, and only the two-particle decay modes contribute 
to the second, as opposed to a mixture of the one- and two-particle modes. As a 
consequence, the condition for validity of the diffusion ( 12Tj) is slightly weaker and 
cleaner, as it makes reference only to the two leading decay modes within the two- 
particle sector. Furthermore, the effect of the dynamics at time t = before the onset 
of the quasi-stationary state is harder to predict when the first jump moment does not 
vanish identically. 



6. Methods of analysis for models with time-reversal symmetry 

So far in this work, we have shown that if there is a separation of timescales in random- 
copying models on networks between the rate of relaxation to the quasi-stationary state 
and the lifetime of that quasi-stationary state, the exactly solvable diffusion equation ([1]) 
correctly describes the ordering dynamics at all but the earliest times. To demonstrate 
the validity of ([I]), and compute the crucial parameter N e , one must in principle find 
the two smallest eigenvalues of the two-particle reaction matrix and verify the 

condition (|33|) . Whilst this represents a major simplification of the full dynamics, there 
is of course no simple, general expression for these eigenvalues for an arbitrary set of 
transition rates and q. 

In this section, we describe simplifications that arise in models where the single- 
particle hopping process exhibits time- reversal symmetry (detailed balance). In 
particular, this allows one to relate the relaxation rate to the two-particle quasi- 
stationary state, \( 2 \ to the relaxation rate to the single-particle stationary state A^ 1 ). 
This leads to a weaker, but more easily verified, condition for the validity of the diffusion 
dl]). Furthermore, the symmetry allows one to develop a systematic variational method 
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to bound the asymptotic reaction rate from above, or equivalently, the effective size 
N e from below. We develop these methods in this section, and apply them to specific 
combinations of network structure and update rules in the next. 

6.1. Time-reversal symmetry and its consequences 

The stationary state of the single-particle random walk is time-reversal symmetric if the 
detailed balance condition 

Qi'iiii, Qjiiiji Vi,j (47) 

is satisfied. Although this may seem an overly restrictive constraint on the rates m^-, 
it applies, for example, if they have the general form = mAijaifij, where Ay is the 
adjacency matrix and cti and (3j are arbitrary positive site-dependent quantities. The 
three sets of rules discussed in Section 12.11 fall into this category, and so the methods 
we derive here will be applicable to these cases of interest. 

The relation (14"T|) implies a corresponding symmetry in the matrix M^: 

QiQjM^a = QkQtM^ii ■ ( 48 ) 
As a consequence of this symmetry, all eigenvalues A of are real, and for any right 
eigenvector 0^ the corresponding left eigenvector is oc QiQj(pfj [12] ■ The eigenvectors 
form an orthonormal set, i.e., 



E^*£<M&^ = tfA^A'. (49) 

ij ij 



In particular, this means that the key quantities and characterising the quasi- 
stationary state are related by Qij = AQiQjPij, where the constant A is fixed by the 
normalisation conditions J2ijQij — PijQij = 1- These properties are crucial in the 
analysis of the quasi-stationary state that follows. 

6.2. Relationship between one- and two-particle relaxation rates 

We now show that, as a consequence of fj4H|) . > and hence that the condition 
(143]) implies the condition (|33|) on the validity of the diffusion equation (pQ). The 
condition fj4"3l) may be easier to demonstrate than fl33l) . since the relaxation rate of the 
single-particle dynamics is in principle more accessible than that for the two-particle 
dynamics. Indeed there are methods to bound Xfi from above [43] which would provide 
a means to demonstrate that (143]) is satisfied, although we will not use them here. 

One way to show that A(, 2) > A^ is to use standard perturbation theory |42j . 
We assume we have the full set of eigenvalues and eigenvectors of the matrix for 
some combination of and q, and then increase the coalescence rate on site x by an 
amount 5c. The perturbation of is then 5M^ ke = 5c if i = j = k = £ = x, and 
zero otherwise. Then, for an eigenvalue A of we have from first order perturbation 
theory 



QiQj [y, ,J 



6X = 5c ^ju^q^m = 5c . (50) 
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This is a nonnegative quantity. Hence, in particular, must increase (or remain 
constant) as reaction rates c, are increased from zero. When all Cj = 0, the reactions 
have no effect, and the two particles independently evolve according to the single-particle 
process governed by M^. Hence, the second smallest eigenvalue of equals that of 
when all a = 0. Since A^ 2 -* cannot decrease as the reactions are turned on, we have 
> A^, as claimed. 

6.3. Variational approach for bounding the asymptotic coalescence rate 

Since the eigenvalue A^ that sets the characteristic timescale of the diffusion (pQ) is the 
smallest of a set of real eigenvalues, we can use a variational principle [12] to bound it 
from above. Let us denote such a bound A+. The utility of this is twofold. First, if one 
can show that in some limit 

OT ^y^°> (51) 

Ar Ar 

then the condition for the validity of (pQ) follows. Second, the expression for A + can 
be used as an estimate of the asymptotic reaction rate 

One can construct a bound A + from an arbitrary vector faj through the expression 

^(2) < ^ _ 52ijke QiQj4>ijM ij ke (j)ke 

which is obtained by expanding the vector 0^ as a sum over the eigenvectors of the 
matrix (see, e.g., (12] for a derivation). 

There are two key points here. First, this result does not rely on being able to 
solve the two-particle reaction process. Any choice of 4>ij leads to an expression for A+. 
Even if this choice departs dramatically from the exact quasi-stationary eigenvector P^ , 
it may still be possible to show that one of the ratios ( ]5Tj) vanishes, in turn assuring 
validity of ([I]). The second point is that, in the usual way, one can include variational 
parameters in the expression for and minimise A + with respect to these parameters 
to obtain the best estimate of A^ 2 ** available within with the space of vectors spanned 
by them. We will encounter explicit examples in the next section, details of which are 
in the Appendix. 



7. Applications 

In order to make concrete the ideas and methods developed in the previous two sections, 
we now discuss their application to specific combinations of copying dynamics and 
network structures. In addition to checking certain known limits, we will use the fact 
that our approach does not invoke a coarse-graining over multiple lattice sites to obtain 
new theoretical results for the effect of correlations on the effective size Ag. 
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7.1. Wright's island model 

A spatially-structured random-copying model that is prominent in the population 
genetics literature is Wright's island model [33]. In this model, each site receives an 
offspring from one of the other sites (chosen uniformly) at a constant rate m. That 
is, niij = jp-r f° r & 11 i 7^ 3- The coalescence rate is also taken to be uniform: q = c 
for all i. The limited spatial structure in this model admits exact calculation of the 
eigenvalues A^ and \^\ and therewith demonstration that (pQ) is exact in the N — > oo 
limit. Despite the venerability of Wright's island model, we believe that the validity of 
the diffusion equation (PQ) has not been shown explicitly before, although the resulting 
expression for the effective population size N e is well known (see e.g., [32]; also [35] for 
a review). 

We begin by examining the single-particle dynamics. Since all sites are equivalent, 
the stationary distribution is uniform: Qi = jj- The rate of relaxation to this steady 
state, A^, is the smallest nonzero eigenvalue of the eigenvector M-j = j^(N5ij — 1). 
One can verify by inspection that any N- dimensional vector will a single entry equal to 
1, another equal to —1 and the rest zero is an eigenvector of Since there are N — 1 

linearly- independent vectors of this form, all N — 1 nonzero eigenvalues are degenerate, 
and one finds 

AT 

A - = m a737 ■ ( 53 ) 

We now turn to the two-particle dynamics. It is possible to calculate A^ exactly. 
One way to do this is with the variational method described above. We note that in 
the quasi-stationary state, all knowledge of the initial condition is lost. Thus there 
are only two distinct two-particle configurations in the quasi-stationary state: one in 
which both particles are on the same site, and one in which they are on distinct sites. 
To construct the ansatz, we set all diagonal entries (pa = a and all off-diagonal entries 
equal to unity. The overall normalisation of the vector is unimportant, since this cancels 
in the numerator and denominator of (|52p . Thus we can without loss of generality fix 
one of the variational parameters and optimise the bound ( 152]) with respect to those 
that remain. Since the exact quasi-stationary eigenvector is obtained with a particular 
value of a, it follows that by minimising the right-hand of ( 15 2 p with respect to a will 
give us the exact decay rate of the quasi-stationary state. 

In the Appendix, we show that in the quasi-stationary state, a and A^ 2 -* are given 
by the solution of the pair of equations 

2m 

" = 2m + e-A?> (M) 

Although these equations admit two solutions for \^\ only one is positive. This is 

■>(N — -n 

(56) 



(2) _ 2mN + c(N - 
q ~ 2{N- 1) 



lmc(N 



\j [2mN + c(N - 
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We can now use condition f )43|) to identify limits in which the diffusion ([T]) with 
effective size N e = 2m/ X^ is valid. One is the fast-migration limit, — — > 0, for which 
exact results are available on arbitrary network structures jl6]. For Wright's island 
model we have 



Ai 2 ) N - 1 



c 



+ 



c 
m 



2^ 



(57) 



A« N 2 m 

indicating that in this limit, (|T|) is valid for any N. In this regime, A^ 2 ** ~ c/N. This 
is consistent with the general result obtained by other means jl6], and thus serves as a 
basic check of the methods derived here. 

For general c and m, we have in the infinite system size limit, iV — > oo, that 

A "- ^ ^Od). (58) 



\,W 2m + cN \N 2 , 

Applying the condition (|43p we find that the diffusion ([T]) is valid for any combination 
of m and c with an effective size N e = N(l + 2™). This is in agreement with the large- A 7 " 
expression for the effective population size quoted in the literature (see, e.g., [32|H5]). 
To achieve this agreement, however, one needs to match up the choices of time units 
and copying rates that are used in standard population genetics models: the correct 
choice has the literature effective size related to our own by = N e /Am and the 

reaction rate c = 1/2M where M is the subpopulation size. In these units, m is the 
fraction of individuals replaced by immigrants in each generation. What is perhaps 
more significant is that, as far as we are aware, the applicability of this single effective 
population size over extended timescales has not been demonstrated explicitly even for 
this extremely simple model, even though it is routinely assumed. 

We remark that in the limit m — > where discrete-frequency models, like the 
original voter model, are recovered (see section I2~2"j) . we find that N e = N, as expected. 
Thus at least in this case, the limits of infinite subpopulation size M and vanishing 
migration rates m indeed commute. 



7.2. Bipartite network 

The bipartite network, previously examined in [T0l|23j[26], is insightful because it is 
perhaps the simplest network structure for which the three rules of Section 12.11 are not 
equivalent. Furthermore, as we will see below, degree correlations can be relevant, in 
the sense that they can lead to departures from results previously obtained by assuming 
an absence of such correlations [TQldSlESlEZ] . 

The A^ nodes of the bipartite network are divided into two groups, A and B, with 
a number n& of the nodes in group A, and ub in group B. The rate m^ at which an 
individual i from group A is replaced from a copy of an individual j from group B is 
niij = m^B- At rate tuba, copying takes place in the reverse direction. As in Wright's 
island model, the rate c is taken to be the same on all nodes. Note that individuals are 
never copied between two nodes belonging to the same group. See Figure El 
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Figure 3. Bipartite network comprising two groups of nodes, A and B. Each site in 
group A may receive copies from any site in group B with equal probability per unit 
time, and vice versa. For clarity, the full set of edges has been shown only for the 
shaded nodes. 

Eigenvalue Degeneracy Left eigenvector 

A = 1 {m BA ; m AB ) 

X XA = n B m AB n A -\ ([1,-1]; 0) 

Xib = n A m BA n B - 1 (0; [1, -1]) 

A 2 = n A m BA + n B m BA 1 {-n B ; n A ) 



Table 1. Eigenvalues of the single-particle random walk on the bipartite network, 
(x; y) denotes a TV-dimensional vector whose first n A elements are given by x and the 
remaining tib elements by y. The notation [1, —1] indicates a vector of the required 
dimensionality that has one element equal to 1 , another equal to — 1 and the rest equal 
to 0. 



To analyse the copying dynamics on this network, we follow the strategy used above 
for Wright's island model, starting with the single-particle dynamics. One can verify by 
inspection that the eigenvalues and eigenvectors of the matrix have the structure 
presented in Table [U Note in particular that the elements of the zero left eigenvector 
give the stationary distribution Qt (up to a normalisation constant). Hence, 

{ m BA i / ■ / 
; 1 < i < n A 
n A m BA + n B m AB 
m AB ^ ■ , 
n A <i <n A + n B 
n A m BA + n B m AB 

Meanwhile, the second-smallest eigenvalue is 

A^. 1} = min{n A m BA ,n B m AB } . (60) 

To bound the decay rate of the quasi-stationary state, we make use of an 
extension to the variational ansatz described in the previous section (see Appendix for 
details). This extended ansatz still takes the quasi-stationary eigenvector (f)^ = 1 when 
two particles occupy different sites i ^ j. However, it allows for site-dependent values 
(pa = cti when both particles occupy the same site. The physical idea behind this ansatz 
is that the reaction process most strongly affects the probability of finding two particles 
on the same site, and has a weaker effect elsewhere. 

A key quantity in this analysis is the inverse participation ratio, which is defined 
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as 

£ = (6i) 

i 

Roughly speaking, X -1 measures the number of sites on which the particle may be 
found in the stationary state (see [47J for further discussion). If X vanishes as N — > oo, 
the stationary distribution is delocalised, and A^ 2 -* can be shown to be asymptotically 
bounded from above by 

(62) 

where rrii = m ij is the total rate at which a particle departs from site i, and Cj is the 
coalescence rate on site i. Furthermore, if A^_ also vanishes with N, the expression given 
above is the optimal bound on A^ within the space of quasi-stationary eigenvectors 
admitted by the ansatz. These result are derived the Appendix. We note also that (1621) 
coincides with an estimate for the effective size that we obtained previously by more 
approximate means [23]. 

In the case of the bipartite network, takes on two values: m ; = ub^ab if % is in 
group A and nii = n A m A s if i is in group B. Recalling that q = c is constant, we find 



x(2) < y* ^ ^n A nBm AB mBAC 
9 ~ + (n A m B A + n B m AB ) 2 



m B A m AB 



(63) 

2n B m AB + c 2n A m BA + c. 
if the inverse participation ratio vanishes with N. We can use this bound, along with 
( |60|) . in AST]) to establish cases where the diffusion equation (JTJ) is valid for the bipartite 
network. 

In fact, one can obtain A^ 2 -* exactly for the bipartite network by following the 
same procedure as for Wright's island model — that is, by recognising that there are 
five distinct two-particle configurations in the quasi- stationary state, and identifying 
each with a variational parameter. If one follows this path, one finds that when X 
vanishes, so does the ratio (}13"]) . and that A^ = A^ asymptotically. That is, we reach 
exactly the same conclusions with this more complicated approach, so we present only 
the results obtained using the simpler analysis just described. 

Of the three rules discussed in Section 12.11 the voter dynamics and invasion process 
are the most interesting. We discuss these two cases separately. 

7.2.1. Voter dynamics Within voter dynamics, we have 

m m 
mAB = — and tuba = — • (64) 
n B n A 

The rate of relaxation to the single-particle steady state is simply A^ 1 -* = m. Meanwhile 

the inverse participation ratio X = in ^ nB , which vanishes as long as both ha and 

n B = N — n A grow with iV as N — > oo. When this is the case (T5TT) is satisfied, 

and the asymptotic rate of decay of the quasi- stationary state is 

2mc _ 2mc N . . 
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Figure 4. Mean reaction rate per particle pair as a function of time for coalescing 
random walks on bipartite graphs with hop rates chosen to correspond to the voter 
model and infinite on-site reaction rate. Symbols show the mean reaction rate within 
an n-particle sector plotted at the mean time t n between entering and exiting that 
sector. These averages were taken over 100000 realisations of the reaction dynamics 
on any given network structure. Time has been rescaled according to ti to allow direct 
comparison of two different bipartite network structures: one has equal group sizes 
whilst in the other they differ greatly. 



In fact, since in the standard voter dynamics, the total rate of copying into a site, 
rrii, is the same for every site, is proportional to the inverse participation ratio on 
any network. We remark that in the limit — — > in which the traditional discrete- 
frequency voter model (see Section 12. 2j) is expected to be recovered, the result (1651) 
corresponds with that obtained for the voter model on bipartite networks obtained by 
other means [231126] . 

If, say, ua remains finite as N — > oo, the inverse participation ratio no longer 
vanishes. The stationary distribution is localised within the smaller group. As a 
consequence, condition (l5Tj) is no longer satisfied, suggesting (but not necessarily 
implying) that there is no separation of timescales in the two-particle dynamics. We 
test this explicitly with Monte Carlo simulations of the backward-time particle reaction 
dynamics. Initially, we place one particle on each site, and use the Gillespie algorithm 
[18] to determine the next particle to hop and the time that it does so. If two particles 
occupy a site simultaneously, they immediately coalesce: thus we are directly simulating 
the limit — — > 0. In Figure IH we show an empirical measure of the pairwise reaction rate 
as a function of time. This is achieved by taking the reciprocal of the mean time spent 
in the n-particle state (averaged over multiple runs), and dividing by \ Zj to convert the 
total reaction rate to a rate per particle pair. As can be seen from Figure HJ in the case 
where ha *C N, there is a significant transient in which the reaction rate departs from 
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Figure 5. Mean reaction rate per particle pair measured in computer simulations of 
coalescing random walks on bipartite graphs with hop rates chosen to correspond to the 
voter model. Circles show the case where one group has a fixed, finite size for different 
values of the total system size N; and other symbols cases where both group sizes are 
proportional to N. Error bars indicate the size of the change in rate over the history 
of the dynamics (see text for further discussion). The theoretical predictions for the 
asymptotic decay rate (|65p are shown as dashed lines. The discrepancies evident for 
the case of a fixed group size are expected within the theoretical treatment developed 
here (see text). 



its asymptotic value. 

Behaviour in the limit iV — > oo is revealed by Figure [5j Here, a fit to the asymptotic 
reaction rate as a function of iV is compared for the case n& = const versus oc N. This 
is done by fitting a constant function to data series like those shown in Figure HI The 
error bar indicates the standard deviation of the set of rates: this gives an indication of 
how much transients are contributing to the average. As can be seen, when n& remains 
constant as N increases, there is no evidence for a decrease in the contribution of the 
transient, suggesting that the early-time relaxational dynamics cannot be neglected in 
the limit iV — > oo, and thus that the diffusion (pQ) is indeed not valid in this case. 

We also examine the effect of intermediate subpopulation size M (see Section [2] for 
the precise definition) on ordering times. We recall that the limit M — > oo is used to 
obtain the Fokker-Planck equation ffT3]) . At finite M it is possible that additional terms 
may contribute. To investigate this, we performed direct Monte Carlo simulations of 
the forward-time random-copying dynamics specified in Section [2] with copying rates 
given by We took the overall rate of copying between sites fi = -h, so that then 
rriij = m = 1 and q = c = 2(1 — -p). Initially, half of the M individuals on each 
site was taken to be of the species of interest, so that the initial value of £ = |. In the 
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Figure 6. Mean ordering time for the voter model on the bipartite graph with 
subpopulation sizes M = 10, 20, 40 and 80 and group sizes ua and tib both proportional 
to the system size N . Each point was obtained from 500 realisations of the forward-time 
ordering dynamics from an initial condition comprising equal numbers of two species 
on each site (the exception is the point with the longest ordering time, M = 80, 
riA = Ub = 200, for which 200 samples were obtained). The dashed lines show 
predictions from the M — s- oo theory via Eq. (|66D . 



limit M — > oo, the effective size appearing in (PQ) is found from (IBl)]) to be 

8n A n B 

N e = — . (66) 

From a solution of a backward equation corresponding to ([I]), as described for example 
in [221119], one finds that the mean time T Q for complete order to be reached (i.e, for 
£ to reach or 1) from the initial condition £ = | is given by T Q = iV e m2. The 
result of performing many repetitions of the forward-time copying dynamics until order 
is reached, and averaging the ordering time, is shown in Figure |6] for various M. We 
see that even with M = 10, agreement with the M — > oo result is reasonable. Closer 
inspection of the data reveals that T Q is systematically overestimated in this small M 
regime; however, already for M = 80 the data are statistically indistinguishable from 
the infinite M prediction. Thus we find that even relatively small subpopulations are 
reasonably well-described by the continuous frequencies Xj. 

7.2.2. Invasion process In the invasion process, the hop rates of Eq. (164"|) are exchanged. 
The relaxation rate of the single-particle random walk is 

A« = mx minimi, ^1 . (67) 
[n B n A ) 

Meanwhile, the inverse participation ratio is given by 



(n\ + n 2 B ) 2 
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Figure 7. Mean reaction rate per particle pair measured in computer simulations of 
coalescing random walks on bipartite graphs with hop rates chosen to correspond to the 
invasion process. The A group had a size «a = aiV 7 , and the B group ub — N — ua- 
The choices of a and 7 used were (0,7) = (10, 0), (5, |), (|, |). The dashed lines show 
the prediction from Eq. (|6T)|) . As in Figure [SJ the error bars reflect the size of the 
contribution from the transient in the numerical estimate of the asymptotic decay rate 
of the quasi-stationary state. 



which vanishes as iV — > 00 for any combination of and as can be verified by 

taking ha ~ aN 1 with < 7 < 1 and % = N — ha- Then, (|63|) is asymptotically 

optimal. Using (l5"Tj) . we then find, again for any combination of ha and rig, that the 

single-coordinate diffusion ^ applies for the invasion process on the bipartite graph. 

For 7 < 1 in the above group-size definitions, the estimate for the decay rate of the 

quasi-stationary state takes the particularly simple large- iV form 

2ma , , 

A + ~ ^ ■ (69) 

We compare the predictions of this formula in the limit — — > against simulations 
of the type discussed above in the context of voter dynamics. As can be seen from 
Figure [TJ numerical measurements of the asymptotic decay rate of the quasi-stationary 
state agree well with the prediction from Eq. fl69|) . Again this provides evidence that 
the dynamics of discrete-frequency models is well-represented by the m — > limit of 
their continuous-frequency counterparts. 

We remark that the result (15^]) differs from that previously obtained within an 
approximation that assumes an absence of degree correlations in the network 1 1 j . Thus 
the invasion process on the bipartite network is a simple, illustrative example of a case 
where degree correlations must be taken into account to obtain the correct reduction onto 
a single-coordinate diffusion of the form (pQ). Although the correct result was obtained 
in [22], the separation of timescales required for its validity was assumed, rather than 
explicitly demonstrated. 
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Figure 8. Explosion operation that induces degree anti-correlations in a network. The 
exploded network (right) is constructed from the template network (left) by inserting 
a node (shown as squares) at the mid-point of each edge. 



7.3. Correlations in random network structures 

We now explore this last point in a little more detail by investigating a transformation to 
a network structure that induces degree anti-correlations. We achieve this by beginning 
with some template network and then 'exploding' it by inserting a node in the middle of 
each edge of the template. See Figure [HJ This way, every node of the template network 
is then next to a node of degree two: thus high-degree nodes end up next to low-degree 
nodes. 

Clearly, one consequence of this transformation is to increase the size of the network, 
with a likely corresponding increase in the effective size N e . Our aim in this final section 
is to demonstrate that degree ant i- correlations lead to a further increase in N e beyond 
what is predicted by (152]). 

As template networks we use Molloy-Reed networks [50] with an approximate 
power-law degree distribution. These networks are designed to have weak degree 
correlations. They are constructed by assigning to each node i its desired degree by 
attaching Zi half-edges to it. Then, these half-edges are formed into complete edges by 
picking a random pair and joining them together if the two nodes in question are not 
already connected. Once all half-edges are joined, the probability that nodes % and j 
are connected jK\ where 



where A 1 is a normalisation constant. For large z, this approaches the power-law 
distribution p z ~ . We shall however mostly be looking at the behaviour on relatively 
small networks: the purpose of using the above form for p z is to allow the possibility of 
highly heterogeneous networks (i.e., a large variance in the degree z). We shall also, for 
simplicity, restrict ourselves to voter dynamics, i.e., the case where m; = m on every 
node, and the limit — — > 0. Then, asymptotically, we have from (1551) and (IBTT) an upper 



(70) 



For this study, we use template networks with the degree distribution 




(71) 
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Figure 9. Empirical measurement of the asymptotic decay rate X q relative to the 
baseline A+ of Eq. (|72|) as a function of the size of the uncorrelated template network. 
Two different values of the exponent 7 were used in the power-law degree distribution 
(|71[) . 50 different random template networks were drawn from this distribution for each 
7, and the reaction dynamics repeated 100000 times on both the template and exploded 
network. In this figure, error bars indicate the standard error of the distribution of 
reaction rates within the random network ensembles not the standard error of the 



mean. 



bound 

}C 

\\ ~ 2ml = 2m-§ (72) 
M 

on \^\ since Qi = under voter dynamics [20j[23]. As previously, this bound is valid 
in the large N limit when I vanishes with N. 

In Figure |9] we show empirical measurements of the asymptotic decay rate of the 
quasi-stationary state, divided by (I72"|) . We used two degree distributions, one with 
7 = 2.5, and one with 7 = 3.5. In both cases the minimum degree of any node was 
4. For the original template we find deviations from this prediction of 10-20%. Since 
( 172]) was obtained by assuming an absence of correlations between particles occupying 
different sites in the quasi-stationary two-particle state, it would appear that (at least 
at the network sizes simulated) such correlations exist, despite the fact that degrees of 
neighbouring nodes are designed to be weakly correlated in Molloy-Reed networks. We 
also see that the deviations from (1721) increase dramatically when anti-correlations are 
introduced by the explosion procedure discussed above. Although we have not been 
able to demonstrate analytically that there is a separation of timescales in the dynamics 
on these networks, numerical data (not shown) suggest that the transient is irrelevant 
except possibly on the uncorrelated template network with 7 = 2.5. 

To understand more quantitatively the decrease of A^ (and hence increase of N e ) 
under the explosion operation, we employ a refinement of the ansatz used to obtain 
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r2jl that allows for correlations between unreacted particles on neighbouring sites in 
the quasi-stationary state. Details are presented in the Appendix. The bound on A^ 
that is ultimately obtained in the — — > limit is, for large N, 



/C 2 1 



K 2 



>t ~ 2m^ i _ JT^ , (73) 



S-TK\ 



where 



S=Y,Zi*i3 Z i ( 74 ) 
T = ^ ZiAijAj^Aki . (75) 

ijk 

Here, S is a measure of how much high-degree nodes tend to be connected to other high- 
degree nodes: it is related to the assortativity coefficient of a network [51] . Meanwhile, 
T comprises a sum over triangles in the network, weighted by the total degree of the 
vertices. It is thus similar in spirit to a clustering coefficient [52J in that it vanishes 
if the network contains no triangles, that is, if locally it has a tree-like structure. The 
way these quantities interact in (P73|) is unfortunately not straightforward to interpret. 
However, the deviation from the baseline (T72|) will typically be large when degrees of 
neighbouring nodes are anticorrelated and/or triangles tend to be formed by high-degree 
nodes. 

If nodes are uncorrelated, A^ can be replaced with its ensemble average value in 
the limit iV — > oo, that is, with the probability ZiZj/Ki that nodes i and nodes j are 
connected |23j . In this case, one finds asymptotically that A+ and A+ converge. On 
the other hand, the bounds are distinct after the explosion operation. Suppose the 
template network has given values of K,\ and K,2- Then, after the explosion, there are 
an additional /C/2 additional nodes of degree two. This implies that for the exploded 
network we have JC' = 2/C and £' = C + 2/C (where we use a prime to indicate the 
exploded network). To compute the statistic S', we note that in the exploded graph, 
each original node i is connected to nodes of degree two. Taking into account that 
the sum over all neighbouring pairs i, j can be written as twice the sum over all edges, 
and that each edge is connected to one of the original template nodes, we find that 

S' = A^z^=AC. (76) 

i 

By design, the exploded network contains no triangles, so T' = 0. Using these values in 
fTT3"|) we find, again in the — — >• limit, 

\t~2ml'( 3 --§ r -) , (77) 



,4 2/C 2 , 

which we can contrast with the baseline A^ ~ 2mX'. Relative to this baseline we see 
that the asymptotic decay rate of the quasi-stationary state is reduced by a factor of at 
least | as a consequence of the degree anti-correlations in the exploded networks. 

In Figure [TD] the same data as in Figure M are shown, but this time relative to the 
improved bound fT73|) . We see that in all cases, the improved bound is closer to the 
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Figure 10. The asymptotic decay rates of Figure |H] rescaled relative to the improved 
estimate A# of Eq. (|75|) . Error bars are to be interpreted as in Figure 



measured asymptotic decay rate, although significant discrepancies remain. The reason 
for this is that the ansatz for the quasi-stationary eigenvector <pij does not contain 
sufficient structure to capture all the relevant correlations between a pair of unreacted 
particles. However, the fact that the correction factors shown in Figures [10] and [9] are 
only weakly dependent on system size iV suggests that the correct scaling of the quasi- 
stationary reaction rate with N is captured by the expressions (172|) and ( I73|) . This is 
consistent with previous works in which the scaling of N e with N has been established 
through fairly simple approximation schemes [91 [TOl, [T9J [23] , further work having been 
necessary to improve on the prefactors (2H[27]. Further improvements demand a more 
detailed understanding of the structure of the quasi-stationary two-particle state. 

8. Discussion and outlook 

In this work, we have shown that the onset of order within a large class of random 
copying processes (including the voter model) on spatially-structured networks can be 
described exactly in terms of the diffusion of a single collective coordinate £. This 
quantity is a weighted average of a species frequency over nodes of the network. When 
the reduction onto the single-coordinate diffusion process is possible, the only significant 
effect of the spatial structure relative to the exactly-solvable 'mean-field' diffusion of 
Equation ([T]) is to replace the actual network size N with an effective size N e . 

As we have shown in Section [5j a remarkable feature of the analysis is that in order 
to establish the validity of the single-coordinate diffusion ([1]), one only needs to establish 
two properties of a two-particle reaction-diffusion process. The first is the lifetime of the 
quasi-stationary probability distribution for two unreacted particles. The second is a 
separation of timescales between this quasi- stationary lifetime and that of the relaxation 
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to quasi-stationarity. In principle, to establish the probability that complete order has 
set in by a time t, one would need to compute an iV-point correlation function, starting, 
for example, with the full iV-dimensional diffusion equation (|T3|) . The fact that one 
can show that the one- dimensional diffusion (pQ) is valid by appealing only to a two- 
particle problem represents a considerable simplification in the analysis of voter and 
other random-copying models. 

In Section [7] we demonstrated the separation of timescales within certain exactly- 
solvable models, namely, Wright's island model and on the bipartite graph. Although 
this property has previously been assumed with varying degrees of explicitness 
within these models [TO] , l2l | 126] . we are not aware of this property actually being 
confirmed for these models before. Furthermore, many existing treatments of random- 
copying dynamics on heterogeneous networks [9l [T0ll23|f25l42T] have derived a diffusion 
approximation by aggregating multiple sites in order to construct continuous spatial 
variables. Such approaches have the drawback that the network structure gets 'washed 
out' in the coarse-graining. Here we have described an alternative approach whereby a 
continuous variable is associated with each site, and the network structure is retained. 
As we have shown in Section [71 anti-correlations between the degree of neighbouring 
nodes leads in the example of the invasion process on a bipartite network to a scaling of 
the effective size N e with network size N that differs from that predicted by an approach 
that neglects these correlations [10]. For voter dynamics on random networks, the effect 
of degree correlations appears to be less severe, affecting the prefactor rather than the 
scaling exponent. Since the inverse participation ratio X seems to be the quantity that 
dominates the scaling of N e with iV in these models, the physical origin of the sublinear 
scaling of N c with iV that can be observed [9T fTT1, l2"3 j l2"6"] lies in the localisation of the 
quasi- stationary single-particle distribution Qi in the vicinity of high-degree nodes. 

In existing treatments of random-copying dynamics on heterogeneous networks, the 
methods used to arrive at the diffusion equation ([1]) have to some extent been ad-hoc. 
Here we have formulated a more systematic treatment based on timescales of the two- 
particle reaction-diffusion dynamics. The remaining task is to find good methods to 
estimate these timescales — and thereby demonstrate their separation. In cases where 
the single-particle reaction dynamics is time-reversal symmetric, we have shown that one 
can use the single-particle relaxation rate as a proxy for the corresponding quantity in 
the two-particle sector; we have also demonstrated the utility of a variational approach 
to bound the effective size N e from below. While this has allowed the separation of 
timescales to be demonstrated within very simple models, it was still necessary to 
resort to simulations to establish whether it was present on random network structures. 
Development of these methods — perhaps using bounds on the single-particle relaxation 
dynamics that have been derived elsewhere [43J — would be needed to determine the 
class of networks and dynamical processes for which ([1]) is valid. 

We remark that ([T]) is known to apply to the voter model on square lattices in two 
dimensions and above [28J. In two dimensions, it is found that the effective population 
size grows logarithmically with system size (relative to the rate at which a particle 
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relaxes to the stationary uniform distribution). This can be understood as a statement 
that domains coarsen logarithmically in the two-dimensional voter model [TJ. Although 
the separation of timescales is weak in this case, it is nonetheless present. Thus one 
would expect the separation of timescales to be observed on random networks with small 
diameters, on the grounds that a single particle would be expected to explore the system 
on a faster timescale than would be possible on a two-dimensional square lattice of the 
same size. It would be interesting to pursue this line of enquiry further. 

As mentioned in the introduction, the use of a single effective size N e to characterise 
purely stochastic population dynamics in subdivided populations is widespread in 
population genetics (see e.g., [15] for a review). A fair amount of discussion has 
centred around the fact that different definitions of N e are possible, depending on which 
property of the stochastic dynamics is measured and calibrated against an unstructured 
(mean- field) population [H] . Whilst it is widely understood that asymptotically many 
definitions converge onto the lifetime of the quasi-stationary state (also known as the 
eigenvalue effective size [53]), the importance of a separation of timescales between 
this lifetime and others seems to be less well established. For example, Sjodin et 
al [35], identify the asymptotic rate of particle coalescence as defining 'the' effective 
population size. In these analyses, they examine a separation of timescales in the 
definition of migration and coalescence rates, and q. That is, migration rates that 
vanish with local population size M with differing exponents. This approach potentially 
misses the possibility — demonstrated here in the examples of Section [7J — that one can 
have a separation of emergent timescales that is sufficient to allow a single effective 
size that characterises almost the entire history of the dynamics. The importance 
of emergent timescales has perhaps been most clearly recognised by Wakeley and co- 
workers [331 [3^1 [36] . I n particular, the rapid relaxation of the single-particle dynamics 
as a condition for a single, well-defined effective size was noted in [36J. However, 
that study was restricted to a rather special class of vertex-transitive networks and 
dynamical update rules. Furthermore, the irrelevance of all timescales other than the 
relaxation of the quasi- stationary state was implicitly assumed rather than explicitly 
demonstrated. We believe that the present analysis offers considerable additional insight 
into the applicability of a one-dimensional diffusion characterised by a single effective 
population size for a general class of random- copying dynamics on complex networks. 

Nevertheless, a number of open questions remain. As we have already noted, the 
task of calculating the relevant decay rates and A^ 2 -* is in general challenging (despite 
being simpler than solving the full iV-body dynamics). Although we have shown that 
certain simplifications arise when the single-particle random walk dynamics are time- 
reversal symmetric, there is of course a much larger class of processes that do not 
respect this symmetry: these have nonequilibrium steady states. Here we have not been 
able to show a result analogous to that derived in Section I6.2[ that is, to show that 
the relaxation rate to two-particle quasi-stationarity exceeds the relaxation rate to a 
single-particle nonequilibrium steady state, although intuitively one might expect this 
to be the case. Furthermore, we do not know of any convenient means to bound the 
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asymptotic reaction rate in these models. 

Although we have found that we recover known results for discrete-frequency 
models, or obtain good agreement with simulations, in the limit — — > 0, as we noted in 
Section I2.2[ this correspondence relies on the limits of infinite local subpopulation size 
and vanishing migration rate to commute. Again, we are not aware of a proof that this 
should be the case. At the end of Section [5] we commented that the initial condition on 
the diffusion equation may be modified as the system evolves into the quasi-stationary 
state: further clarity here would be beneficial. Additionally, although the coordinate £ is 
sufficient to track the degree of order present in a system, other aspects of the evolution 
cannot be determined from this quantity alone — for example, the typical state of the 
system or correlation length given some value of £. It is possible that some of these 
quantities can be obtained with knowledge of the quasi-stationary eigenvector fy, this 
by definition encapsulates two-point correlations. It is unclear whether higher-point 
correlation functions could be inferred from the two-point functions, or whether these 
would need to be determined separately. 

Finally, we revisit the fundamental property shared by the entire set of random- 
copying processes discussed here, namely, that all species are equivalent. There is 
considerable interest in those cases where one species is fitter than another. Although in 
some sense this is only a minor modification to the dynamics, the mathematics become 
much more involved. In particular, the backward-time dynamics are more complicated: 
the reaction rates then depend on the frequencies of different species in the population 
at the point in history at which the coalescence reaction takes place. It may, however, 
be possible to apply the results obtained here for the neutral theory in some kind of 
perturbation theory to study the effects of a small selective advantage. We leave this as 
one of many possibilities for future work. 
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Appendix A. Convergence to discrete-frequency models in the limit of slow 
copying rates 

In Section 12.21 it was noted that a convergence theorem due to Mohle [37] implies that 
if one takes the between-site copying rates fj,^ — > Vz ^ j at any fixed subpopulation 
size M, the discrete- frequency (M = 1) dynamics is recovered. In this appendix we 
provide an informal account of this mathematical result that illustrates how this limit 
is reached. We refer the reader to [37J for the technical detail. 

Our starting point is a discrete-time version of the random-copying process. At any 
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point in time, the state of the system, a, is specified by the set of numbers n, which 
count the number of individuals of the species of interest on site i. In any one timestep, 
one of these numbers may change by ±1. The idea now is to split the matrix A of 
transition probabilities that appears in the master equation P a i(t + 5t) = J2a A a > !<T P a (t) 
into two pieces. The first, L, accounts for local interactions: that is, when an individual 
is replaced by sampling from the same site. The second, /, accounts for interactions 
between sites. Both are stochastic matrices. If the latter happen a fraction e of the 
time, then 

A= (l-e)L + eI . (A.l) 
The theorem in [37] then states that the vector of probabilities P(t) has the limiting 

form 

P(n5t) ->• e enTIV VP(0) (A.2) 
as e — > 0, subject to the existence of the projector 

V = lim L n . (A.3) 

n— >oo 

The way to read (1A.2j) is as follows. The projector V performs an infinite number 
of local copying events on every site. If on site i there are initially n, individuals of the 
species of interest, then at the end of this process rij — > M with probability and 
Hi — > with probability 1 — -rj. Thus the initial condition is projected onto a state in 
which every site is occupied by individuals of a single species. The exponential term 
in (1A.2j) is the generator of a continuous-time process. If we take the limit 8t — > 0, a 
copy between sites is taken at rate e per timestep. A single copy may leave the system 
unchanged; or it may change one of the n, values by ±1. In the latter case, all but one 
of the individuals are of the same species. Then, the projector acts, either to restore 
the existing value of (with probability 1 — -h) or to 'flip' it (i.e., from M — > or vice 
versa) with probability -g. 

Putting this together, one finds that the frequency x% = ji flips from to 1 (or 
vice versa) due to a copy taken from a neighbouring site j at a rate 4r multiplied by the 
probability of copying from j to i in the original discrete-time process. In Section [21 the 
random- copying process was set up as a continuous-time process in which copies from j 
arriving at i at a rate fiij. This is obtained here by putting the transition probabilities 
encoded in / equal to Then, in the original time units, a copy between sites is 

taken at rate /i^- = jjfiij- Rescaling time such that t' = -pt, we obtain in the limit 
e — > the M — 1 (discrete frequency) dynamics from any model with M > 1. Note 
that in the case of the dynamical rules ((31)-© we can identify e with /i. 

Appendix B. Variational bounds on the quasi-stationary decay rate 



As stated in the main text, when the backward-time hopping process is time- reversal 
symmetric, all eigenvalues of the matrix are real, and hence the smallest eigenvalue 
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satisfies the bound 

^(2) < ^ _ ^ijki QiQ j ( t ) ij^ij\i'ritt . j, | j 



for any vector 0y. In order to construct bounds on A^ one can use a variational 
approach. That is, one can include various parameters in (p^, and minimise the right- 
hand side of (IB.lj) . thereby obtaining the most stringent bound on available within 
the space of vectors spanned by these parameters. 

The key to designing a variational ansatz is to recall that the quasi- stationary 
distribution of two unreacted particles oc QiQj(pij. Thus (p^ measures the extent 
to which the probability of finding a pair of particles on sites i and j is affected by 
the interaction between them. Alternatively, one can infer from the interpretation of 
Pij oc <pij given at the end of Section [5] that (py is the probability that two particles 
starting on sites i and j do not react before the onset of the quasi- stationary state. 
Either way anticipate that (pij will be close to unity for particles that are far apart, and 
decrease as their separation decreases. 

We make use of two variational ansatze in the main text. In the first it is assumed 
that <pij differs from unity only when both particles are on the same site. In the second 
we extend to include the effects of interactions when particles occupy neighbouring sites 
(but at the expense of restricting to voter- type update rules only). 

Ansatz invoking on-site interactions only Restricting the effect of interactions to 
particles occupying the same site, corresponds to an ansatz for (p^ of the form <p u = cti 
and (pij = 1 if i ^ j. Differentiating (IB.lj) with respect to a m , and setting it to zero, we 
find that 

(2) (2) 
Y.kl^mmM^kl T,ijkt QiQj4>ij Mij,ke4>ke . , . 

2 = v n n a? = + ' ^ > 

Ymm L^ij l *ti'*tj ( Pij 

Using the definition of M^ 2 \ Eq. (TT9]) . we find that 

Yl M mm,U^ = 2m i( a i ~ X ) + C i a i ( R3 ) 

ke 

where rrii = J^j^i^ij, the total exit rate from site i in the random- walk picture. 
Substituting into (IB . 2j) . we find that 

oti = — — . B.4 

2rrii + Q — A + 

This is not sufficient to determine the optimal cti, since the optimal value of A + is not 
known. To compute this, we need an additional relation between them. This can be 
obtained as follows. First, we note that ( TB.2j) can be written as 

Qi H M uli<Pkt = KQ 2 i<t>a ■ (B.5) 

kl 

Meanwhile, if we temporarily set (pij = a^+i for % ^ j, differentiate with respect to this 
additional a parameter and set to zero, we find analogously that 

£ QiQ 3 £ AfgUw = A + E QiQAi ■ (B-6) 
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Summing flB.51) over all i, and adding to this expression, we find that 

where the second equality holds because Qi is the zero left eigenvector of the matrix 
Using the definition (1T9]) and the form of the ansatz for </>y, we ultimately obtain 

a Si Qi CiOLi . . 

A + = i — ^ n2 /i v • l B -°) 

l - EiQi(l - «i) 

The optimal bound on is found by solving the simultaneous set of iV + 1 equations 
given by flRlj) and (IB~8l) . 

For Wright's island model, discussed in Section I7.1[ m ; , q and a; t are all site- 
independent quantities, and Qi = ■h. Making these substitutions in (IB.4[) and flB.8|) . we 
arrive at the pair of equations (1541) and fl55|) studied in the main text. 

For more general models, solving (1B.4|) and (IB.8P exactly is difficult. Nevertheless, 
one can still make some progress here. An alternative way to formulate this problem is 
to define 

» w = i^S«^K- < B - 9 > 

Then, the optimal bound A + is given by the value of x for which y(x) = x. 
Differentiating, we find 

Hence, the root of y(x) = x must lie at a value of y(x) < y(0); hence A + < y(0). Thus, 
one has that 

Note that the right-hand side of this expression is not necessarily the optimal bound. 

However, if the inverse participation ratio X = J2iQi vanishes as N — >• oo, then 
asymptotically we obtain the simpler expression 

K<K~ZQ!^ (B.12) 

which was used in the main text. This bound is optimal within the present ansatz for 
4>ij if A + also vanishes as N — > oo. Then, since 

A+ = !-£,« (B13) 

and an > „ 2m " , 

1 — 2rrii+Ci ' 

If both X = J2iQf an d A+ vanish as iV — > oo, this lower bound on the optimal A + 
asymptotically approaches the upper bound (IB. 12j) . and hence the right-hand side of 
(IB.12j) is optimal in this case. 
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Ansatz invoking on-site and nearest-neighbour interactions As noted in the main text, 
the traditional voter model dynamics is obtained by taking rrii — m and q = c followed 
by the limit ™ — > 0. In the above ansatz, this leads to all = 2 ^+ c an< ^ ^+ ^ ^° 
examine the effects of degree correlations between neighbouring nodes, it is natural to 
introduce an ansatz for faj that takes one value when both particles are on the same 
site, a different value when they are on neighbouring sites, and unity elsewhere. That 
is, 

fa = 1 + (a - l)6ij + {13- l)Aij (B.15) 

where A^ is the adjacency matrix for the network. Recalling that Aij = if % = j, we 
see that this expression takes the value a if % — j, (3 if % and j are neighbours, and unity 
otherwise. For voter dynamics, we have from (J3J) and (j!9p that 

M ijie = (jn5 ik - 7"^ fe ) Sje + { m ^ e ~ Y Aj ^j Sik + cSijke (B.16) 

where Zi is the degree of node i. 

Differentiating the right-hand side of (IB.ip with respect to a and setting to zero, 
we find 

E QlM^u = K E Qi<t>u (B. 17) 

E Q- [(2m + c)a - 2m/3] = A+ E Qi« • (B.18) 

i i 

Hence, 

2m/3 .„ , 

a = — — . B.19 

2m + c - A + v ; 

We notice the similarity with (1B.4[) : the on-site correlation (pa is multiplied by the 
nearest-neighbour correlation (3. To obtain an expression for this correlation, we 
differentiate the right-hand side of (IB. II) with respect to (3, and set to zero. This yields 

E QiAijQjM^^ju = A+E QiAjQj<Pij • (B.20) 

This expression can be evaluated by using the fact that, for the voter model, the 
stationary probabilities are Qi = Zi/K,\. The sum on the right-hand side becomes 

1 S 

^2 E ZiAjZ^ij = (3-^1 (B.21) 

where fC n and S are defined by Equations (ITU]) and fPT4l) respectively. The sum on the 
left requires a bit more work. One finds it can be written as 



2m / x — , ^ A^ 

%i Zj 

ij ijk 



i -2 I E ZiAijZj(j)ij ^^ZiAijZj- 



2m 
2m 

lq 



PS - E SiAjA,-* [1 + (a - 1)<J W + (/3 - 1)A W ] 

ijk 

™tfS-S-{a- 1)/C 2 - 08 - 1)T) (B.22) 
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in which T is given by (j75|) . Hence, 

2m [(P - \){S - T) - {a - l)JC 2 ] = X + f3S (B.23) 
Substituting (IB. 191) . and rearranging, we finally find 

= 7 1 S - IC2 ~ r . (B.24) 

\ 2m J 2m+c— A+ 

Similar manipulations allow one to determine that 

A = E« ZjfciC = adC 2 , , 

+ Y.ijZizrfij K,\ + (a - 1)/C 2 + {p - l)S ' 1 • j 

When the inverse participation ratio X vanishes with N, we know that A + does too, 
since flB.12j) bounds it from above, and is proportional to X under voter dynamics. The 
upshot of this is that in this case, we can neglect the terms in A + in (IB.19P and flB.24p . 
There are further simplifications in the limit ^ — > which is case studied in the main 
text. To leading order in — one finds 

a = ^ and = 1--?*-. (B.26) 
c S — / 

Using these in flB.251) . and further taking into account that the ratio X = §S§ is assumed 
to vanish as — » oo, we ultimately obtain the expression f [73|) quoted in the main text. 
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